feat(kernel): add SentenceBuffer for streaming text chunking (#1205) by crrow · Pull Request #1210 · rararulab/rara

crrow · 2026-04-09T02:42:43Z

Summary

Add SentenceBuffer — a pure text segmentation utility that accumulates streaming TextDelta chunks and emits complete sentences on sentence-ending punctuation (。！？.!?\n).

No async, no I/O, no TTS dependency. Designed to sit between an LLM streaming output and a TTS synthesizer for sentence-by-sentence voice reply (#1206).

Handles Chinese/English mixed text
Consecutive delimiters collapsed (?! → one sentence)
flush() drains unterminated text at turn end
9 unit tests

Type of change

Type	Label
New feature	`enhancement`

Component

core

Closes

Closes #1205

Test plan

cargo test -p rara-kernel --lib sentence_buffer — 9/9 pass
Pre-commit hooks (check, fmt, clippy, doc) all pass

Pure text segmentation utility that accumulates TextDelta chunks and emits complete sentences split on sentence-ending punctuation (。！？.!?\n). No async, no I/O — designed to sit between an LLM streaming output and a TTS synthesizer. Handles Chinese/English mixed text, consecutive delimiters, incremental deltas, and trailing unterminated text via flush(). 9 unit tests covering all edge cases. Closes #1205

crrow added enhancement New feature or request core Core system changes labels Apr 9, 2026

crrow added this to Rara Roadmap Apr 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(kernel): add SentenceBuffer for streaming text chunking (#1205)#1210

feat(kernel): add SentenceBuffer for streaming text chunking (#1205)#1210
crrow wants to merge 1 commit intomainfrom
issue-1205-sentence-buffer

crrow commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

crrow commented Apr 9, 2026

Summary

Type of change

Component

Closes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant